Hierarchical Matrix Timestamps for Scalable UpdatePropagation

نویسندگان

  • T. Johnson
  • K. Jeong
چکیده

Update propagation is a technique for the weakly consistent replication of a database. Operations are performed at the local copy of the database, then are asynchronously propagated to the replicas. In large scale applications where the weak consistency can be tolerated (e.g., network routing tables, bulletin boards etc.), update propagation can give much better performance than conventional database techniques. However, standard algorithms for ensuring that an update is eventually propagated to all replicas require that each site store and communicate an O(N 2) matrix timestamp, where N is the number of sites that store a replica. If N is very large (e.g. N = 10; 000 or larger), then the cost of maintaining the matrix timestamp limits the scalability of the replicated database. Also, standard algorithms require global membership information, and keeping the membership information up-to-date can be diicult. In this paper, we present an algorithm that can signiicantly reduce the size of the matrix timestamp. The sites that store a replica of the database are partitioned among a set of domains. We replace matrix timestamps with hierarchical matrix timestamps, which have precise information about processors in the same domain, and summary information about other domains. If there are O(p N) domains, the hierarchical matrix timestamp can require only O(N) space. Furthermore, the hierarchical matrix timestamp allows hierarchical replica management. The cost of the reduced space of the hierarchical matrix timestamp is a lower quality estimate of the progress of the other processors. As a consequence, updates must be buuered for a longer time to ensure safety, increasing the size of the update log. We present detailed simulation studies that quantify this tradeoo. We explore a set of optimization techniques for reducing the average log size. When all of the optimizations are applied, there is no increase in the log size as compared to the regular update propagation algorithm.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hierarchical Cluster Algorithm for Dynamic, Centralized Timestamps

Partial-order data structures used in distributed-system observation tools typically use vector timestamps to efficiently determine event precedence. Unfortunately, all current dynamic vector-timestamp algorithms either require a vector of size equal to the number of processes in the computation or require a graph search operation to determine event precedence. This fundamentally limits the sca...

متن کامل

A Scalable Partial-Order Data Structure for Distributed-System Observation

Distributed-system observation is foundational to understanding and controlling distributed computations. Existing tools for distributed-system observation are constrained in the size of computation that they can observe by three fundamental problems. They lack scalable information collection, scalable data-structures for storing and querying the information collected, and scalable information-...

متن کامل

Decoding order recovery for multi flow transmission of scalable video coding (SVC) over mobile IP channels

Recent standardization activities for the transport of Scalable Video Coding over multimedia networking channels have introduced means to differentiate layers of a scalable media in transport. Therefore in RTP, those layers are transported in different RTP flows respectively RTP sessions. To decode such layers transported on different IP network flows, a decoding order recovery mechanism is req...

متن کامل

Self-Organizing Hierarchical Cluster Timestamps

Distributed-system observation tools require an efficient data structure to store and query the partial-order of execution. Such data structures typically use vector timestamps to efficiently answer precedence queries. Many current vector-timestamp algorithms either have a poor time/space complexity tradeoff or are static. This limits the scalability of such observation tools. One algorithm, ce...

متن کامل

Cs - 2001 - 01 I Ssues in S Calable D Istributed - S Ystem M Anagement

Distributed-system management concerns the observation of a distributed computation and then using the information gained by that observation to control the computation. This necessitates collecting the information required to determine the partial order of execution, and then reasoning about that partial order. This in turn requires a partial-order data structure and, if the reasoning is being...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996